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Abstract 

By analyzing the concept of contextuality (Bell-Kochen-Specker) in terms 
of pre-and-post-selection (PPS), it is possible to assign definite values to ob- 
servables in a new and surprising way. Physical reasons are presented for 
restrictions on these assignments. When measurements are performed which 
do not disturb the pre- and post-selection (i.e. weak measurements), then novel 
experimental aspects of contextuality can be demonstrated including a proof 
that every PPS-paradox with definite predictions implies contextuality. Cer- 
tain results of these measurements (eccentric weak values with e.g. negative 
values outside the spectrum), however, cannot be explained by a "classical- 
like" hidden variable theory. 
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1 INTRODUCTION 

A traditional concept of the quantum state | \I/ iri ) is that it generally provides only 
statistical information about the outcome of an Ideal Measurement (IM). Therefore 
many authors have proposed that the quantum state could be "completed" by a 
hidden- variable-theory (HVT). A natural category of HVT assigns definite values to 
all possible observables of a system at all times has a similar relationship to Quantum 
Mechanics (QM) as Classical Mechanics (CM) has to Classical Statistical Mechanics 
(CSM). I.e. QM can be understood in terms of a deeper theory, the HVT. The 
relationship between CM and CSM is relatively simple because an ideal classical 
measurement precisely measures a property of a system, without affecting the system 
under study. Measurement of one property will not interfere with measurement of 
another property (i.e. measurement does not depend on context) and the state of 
the system can then be said to exist when we are not observing it. There is thus a 
simple relation between the theory and underlying physical processes (the outcome 
of measurements directly tells us what values to assign to all variables of the theory). 

Moving over to QM, there are two general constraints on any HVT which repro- 
duces QM: a) the Bell-Kochen-Specker theorem (BKS) |2U El and Gleason's (SSJ 
theorem showed that any HVT must be contextual and b) Bell's theorem [20] , showed 
that any HVT must be non-local. Gleason and BKS proved that one cannot assign 
unique answers to yes-no questions (posed to single systems) in such a way that one 
can think that measurement simply reveals the answer as a pre-existing property 
that was intrinsic solely to the quantum system itself. Some versions of BKS depend 
only on the structure of observables, while some versions also rely on the state and 
we therefore define a "value function" V?{A) (the specification of the HVT) as the 

assignment of a value to an observable A when an individual system is in the state 
ip. BKS assumed that VAA) should satisfy: 

Vj(F{A}) = F{Vj(A)} (1.1) 

That is, any functional relation of an operator that is a member of a commuting 
subset of observables must also be satisfied if one substitutes the values for the ob- 
servables into the functional relations. E.g. if a system is characterized by commuting 
observables A\ and A2 then condition II . II requires that all the relationships or func- 
tions between these operators should also be satisfied when V?{A\) and V^(A 2 ) are 
substituted into the same functional relations. This condition determines the sum 
and product rules: 

Vj{Ay + A 2 ) = VjiAi) + Vj(A 2 ) (1.2) 
^(iii 2 ) = V^VtfM (1.3) 

HVTs which meet these conditions are noncontextual: all yes-no questions can 
be associated with a value assignment Vx which provides a single unique answer, 
irrespective of the set of other commuting yes-no questions that it is associated with. 
BKS showed that attempting such an assignment to some observables is inconsistent: 
in any system (of dimension greater than 2) the 2 n possible "yes-no" assignments (to 
the n projection operators representing the yes-no questions) cannot be compatible 
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with the sum 11.21 and product ll.3l rules for all orthogonal resolutions of the identity. 
E.g., consider a complete set of spectral projectors of an operator A with discrete 
eigenvalues, so that Pi = pA= ai , such that J27=i ^(-^j) = 1- Then only one of the 
projection operators can give a "yes" assignment (Pi = 1) and the rest have to be 
"no" (i.e. Pi — 0). However, ip can be decomposed into many different basis sets, 
and the value that assigns must be independent of the particular basis. BKS 
showed that this cannot be done. 

The principal result of this article is to question whether BKS is just a for- 
mal result (i.e. negative statements concerning the impossibility of a classical-like 
"non-contextual- HVTs" ) or if BKS has new positive aspects including experimental 
consequences. We probe this by utilizing the natural connection between counter- 
factual statements and pre-and-post-selection (PPS). We then connect contextuality 
with issues that can be probed experimentally by weak measurements. PPSs were 
originally probed with the time-symmetric re-formulation of Quantum Mechanics 
(TSQM, introduced by Aharonov, Bergmann and Lebowitz a/k/a ABL f'2 ]). To be 
useful and interesting, any re-formulation of QM should meet several criteria such as 
those met by TSQM: 

• TSQM is consistent with all the predictions made by standard QM, 

• TSQM brings out features in QM that were missed before: e.g., ABL considered 
measurement situations between two successive IMs in which the transition from 
a state \^- m ) (pre-selected at a time t- m ) to a state \^e.n) (post-selected at a 
later time ifi n ) is generally disturbed by an intermediate precise measurement. 
Post-selection reflects a unique aspect of QM in that measurement results are 
not determined by equations of motion and initial conditions. A subsequent 
theoretical development arising out of the ABL work was the introduction of 
the "Weak Value" (WV) of an observable which was probed by a new type 
of quantum measurement called the "Weak Measurement" (WM) j^J. WM 
experiments have been performed and results are in very good agreement with 
theoretical predictions, 

• TSQM lead to simplifications in calculations (as occurred with the Feynman 
re-formulation) and stimulated discoveries in other fields: e.g. ABL influenced 
work in cosmology (e.g. Gell-Mann and Hartle [221); m superluminal tunneling 
(Chiao [22] and Steinberg [51]): in quantum information (e.g. the quantum 
random walk [8] or cryptography [HUES]), etc. 

• TSQM suggests generalizations of QM that were missed before - e.g. a new 
solution [22] to the quantum measurement problem. 

Using TSQM, we show how to assign definite values to sets of BKS observables in 
a new and surprising way. We also show how measurement disturbance can arise in 
a new way when value assignments depend on both the pre- and post-selection. An 
"intriguing" physical reason is presented to explain why this scheme cannot be ap- 
plied to 2 or more IMs: the 2 IMs interfere with each other because some assignment 
of eigenvalues to operators are based on just one of the two vectors (i.e. either the 
pre- or the post-selected vector) while some assignment of eigenvalues are based on 
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both vectors (i.e. both the pre-selected and the post-selected vectors, what we call 
diagonal-PPS). In addition, we show that when measurements are performed which 
do not disturb the PPS (WMs), then novel experimental aspects of contextuality can 
be demonstrated, including a proof that every PPS-paradox with definite predictions 
implies contextuality. We also demonstrate an isomorphism between WVs in BKS 
situations and EPR entanglement. Certain results of WMs (eccentric WVs with 
outside the eigenvalue spectrum), however, cannot be explained by a "classical-like" 
HVT. 



2 PPS, CONTEXTUALITY, AND THE 3-BOX- 
PARADOX 

The 3-box-PPS-paradox ([3], verified experimentally [33]) uses a single quantum 
particle that is placed in a superposition of being in 3 closed, separated boxes. The 
particle is pre- selected in |^ in ) = l/y/3 (\A) + \B) + \C)) , where \A), \B) and \C) 
denote the particle localized in boxes A, B, or C, respectively. The particle is post- 
selected in the state \^a n ) = (\A) + \B) — \C)). If an ideal (i.e. von Neumann) 
measurement is performed on box A in the intermediate time (e.g. we open the box), 
then the probability to find the particle in box A is 1, i.e. Pa = \A) (A\ = 1, given 
by ABL 2 : 



IOPfin,*|aj)(aj 


*in,*>| 2 


Ea'K*fin 


, t\a')(a' 


^in,t)| 2 



Prob(a 3 , t\*- m , t in , vi> fin , u n ) = 1 -y ;. ' / ' , 2 (2.4) 



which in this case yields, ProbiP A = — — ,- rr^rprrTT^^ — a IT ,., = 1. This can 
also be seen intuitively without using ABL: if the particle is not found in box A, 
then the initial state \^m) would be projected onto l/y/2 (\B) + \C)), but this is 
orthogonal to the final state |\l/fi n ). Therefore the particle must be found in box A. 
Similarly, the probability to find the particle in box B is 1, i.e. Prob{P-Q = 1) = 1, 
and this is the essence of the "paradox" . 

Now ProbiP a = 1) = 1 if only box A is opened, while ProbiP-g = 1) = 1 if only 
box B is opened. When we measure both box A and box B, then the particle will not 
be found in both boxes, i.e. PaPb — 0. But Pa and Pb commute with each other, 
so how is it possible that measurement of one box can disturb the measurement 
of another? The reason suggested here is that in order to deduce Pa = 1 from 
ABL (or Pb = 1), we used information from both the pre-and the post-selected 
vectors (a situation we call diagonal-PPS); when we actually measure Pa, then this 
measurement will limit the "propagation" of the 2-vectors that were relied on to 
ascertain intermediate values (see fig. Gib). If we then subsequently were to measure 
Pb, the necessary information from both the pre- and post-selected vectors is no 
longer available (i.e. information from t- m cannot propagate beyond the measurement 

2 The time-symmetry in ABL can already be seen from the Born formula Prob(aj 1 t|\&i n , tin) = 
\(aj\U(t, iin) l^in) | 2 - Instead of | (ai \ exp(—iHAt)^> in ) \ 2 for a particular outcome oi, one may 
equivalently say that the probability is | (aiexp(+iH At) | *i n ) | 2 , i.e. that one applies the time 
evolution operator to evolve a\ from t to t- lrL , which is the time reverse of the first picture. 
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of Pa at time ti due to the disturbance caused by an IM). Thus, even though Pa 
and Pr commute, measurement of one can disturb measurement of the other with 
diagonal-PPSs, and this is related to a violation of the product rule 11.31 (In general, 
if l^i) is an eigenvector of A with eigenvalue a and is an eigenvector of B with 
eigenvalue b and [A, B] = 0, then if A and B are known only by either pre-selection or 
post-selection, then the product rule is valid, i.e. AB = ab. However if A and B are 
known by both pre-selection and post-selection, then the product rule is not valid, 
i.e. AB 7^ ab, i.e. they can still disturb each other, even though they commute. |29j ) 
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a) pre-selected vector |*i n ) = l/v3 (\A) + \B) + \C)) propagates forwards in 



time from t- m to t\, and post-selected vector l^gn) = 1/V3 {\A) + \B) 
backwards in time from t nn to £2- b) IM of at ti and of Pg at £2- 



|C)) propagating 



2.1 Weak Measurements 

WMs can be quantified in the quantum measurement theory developed by von Neu- 
mann IHEj: to measure an observable A of the system, one may use an interaction 
Hamiltonian of the form H mt = —\(t)QA where Q is an observable of the measuring 
device (MD) and X(t) is a coupling constant which is non-zero only during a short 
time (0, T). Using the Heisenberg equations of motion for the momentum P of MD 
(conjugate to the position Q), we see that P changes according to ^£ = \(t)A. In- 
tegrating this, we see that P(T) - P(0) = \A where J T X(t)dt = X. To make a more 
precise determination of A requires that either a) P(0) and P(T) are more precisely 
defined or b) A is large. 

A WM can be characterized by either a) P of MD is measured to a finite precision 
AP, (which limits the disturbance by a finite amount AQ > 1/AP) or b) small A. 
After the WM interaction, the system is post-selected. In this regime, the measure- 
ment becomes less precise because the uncertainty AP in the position of the pointer 
is larger than the difference in the shifts of the pointer Aa« corresponding to the 
different eigenvalues and thus the shift in MD is much smaller than its uncertainty. 
The simplest derivation of the WV result is with the second approach, i.e. A small 
(f\(t)dt = A « 1): 

OMe-^l^) w (^ fin | V in ){l + i\QA w ) w (^ fin |^ in )e^ A ^ (2.5) 

The final state of MD is almost un entangled with the system and is shifted by the 
WV A w (assuming without lack of generality that the state of the MD is a Gaussian 
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with spreads A = AP = AQ = 1): 

ICOP) - (*to\e^ A \V ia )*g D {P) = exp{-(P-\A w ) 2 } (2.6) 
Wh ere A, = <*5^> (2.7) 



For the PPS cases considered, we do not need to perform the above calculation 
to obtain the WV. Instead, we can easily ascertain the WVs due to the following 
theorems: 

Theorem 1: The sum of the WVs is equal to the WV of the sum: 

if (P A ) W = (P fl + P c ) w then (P A ) W = (P B ) W + (P c ) w (2.8) 
Proof: this follows simply from the linearity of the operators 
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Theorem 2: If a single IM of an observable Pa is performed between the pre- and 
the post-selection, then if the outcome is definite (e.g. Prob{pA = 1)=1) then the 
WV is equal to this eigenvalue (e.g. (Pa)w = 1) |E]- 

Proof: Given that Pa = J2 n a n\ a n)(oi n \, if an eigenvalue, e.g. Pa = a n , is obtained 
with certainty, then for n ^ m, Pa = |a m )(a! m | = because the probability to 
obtain another eigenvalue by ABL is oc (^finl^m) (&m\^m) = 0. In this case, the WV 
(Pa) w = (\a m )(a m \) w = { ^%:^:f in) = 0. In addition, 



J2 \^finl - 1 (2 10) 







*in) 




*in) 



because Y^mWm) •{ot m \ — 1- But since {^^ n \a m ) (a m \^- m ) = for n ^ m, the only 
term left is n. Therefore, the WV is 1, the same as the ideal value. 

From Theorem 2, we know the following WVs in the 3-box-paradox with certainty: 

(P A ) W = 1, (P B ) W = 1, PtotaI = (PA + PB + Pc)w = l- (2-11) 

Using theorem 1, we obtain: 





P 'total ~ J 


Pa-Pb 






*in) 



\<* finikin/ 

= (P a + P b + P c )w-(Pa)w-(Pb)w = -1. (2.12) 

This surprising theoretical prediction of TSQM has been verified experimentally us- 
ing photons jSS]. What interpretation should be given to (Pc)w = — 1? One may 
speculate for formal reasons that this corresponds to a "negative probability." How- 
ever, as will be shown subsequently, this interpretation cannot have any experimental 
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meaning. On the other hand, we can give it a different interpretation that does have 
an experimental meaning if we perform any WM which is sensitive to the projection 
operator Po In this case, we will observe the opposite effect from those cases in 
which the projection operator is positive. This suggests that there is —1 particle in 
box C, e.g. a WM of the amount of charge in box C in the intermediate time will 
yield a negative charge (assuming it is a positively charged particle). 

2.2 Contextuality and P PS- paradoxes 

In j3U] it was first pointed out and extensively discussed and later proven by Leifer 
and Spekkens [21], that whenever there is a logical-PPS-paradox (as in the 3-box- 
paradox), then there is a related proof of contextuality. The proof considers "all the 
measurements defined by the PPS-paradox -the preselection, the post-selection, and 
the alternative possible intermediate measurements-as alternative possible measure- 
ments at a single time." They show a direct connection between each of the 8 vectors 
in the 3-box-paradox and the 8 vectors in the Clifford/BKS-proof: it is readily seen 
(|26j's figure) that no HVT assignment of O's (black circles) and l's (white circles) 
can be made that is consistent with the orthogonality relations, since in order for 
non-contextuality to hold, no 2 orthogonal pairs can be white, which is violated by 
the central 2 circles. 

On the one hand, we will show how definite values can be assigned to observables 
(verified by IMs on PPSs) in surprising ways. While this assignment suggests novel 
connections between what could be said about the state before the IM and after, 
in general, the IM creates a disturbance and thus creates an uncertain relationship 
between the state before and after, reflecting Bub and Brown's jUj concern "en- 
sembles which have been preselected and post-selected via an arbitrary intervening 
measurement... are not well defined without specification of the intervening measure- 
ment." However, this is not the case for WMs (PPS ensembles are well-defined for 
any intermediate WM). In addition, we show [20] how measurement disturbance can 
arise in new ways for IMs of commuting observables in PPS situations, and so it was 
also argued |3lH 123] that a noncontextual-HVT can reproduce QM if we allow for 
a disturbance of the HVs: "the possibility of measurement disturbance blocks the 
conclusion that a PPS-paradox is itself a proof of the contextuality of HVTs." The 
motivation [25] behind this assertion was the belief that PPS-paradoxes could be ex- 
plained entirely within CM [301 and that contextuality should not be regarded as 
fundamental in a classical picture of reality. Nevertheless, the "paradoxical," i.e. non- 
classical, nature of the 3-box-paradox was recently re-affirmed in terms of IMs jH] . 
In this article, we expand this point mainly by showing how non-classical WVs can 
be empirically demonstrated in PPS-paradoxes. E.g. WMs on box C will record the 
"paradoxical," non-classical outcome of —1. WVs like this outside the eigenvalue 
spectrum cannot be reproduced from any positive definite probability distribution of 
eigenvalues [2HII11 EDI- 

Finally, with WMs, there is no measurement disturbance, yet non-classical re- 
sults are still obtained. In £ 12.2.21 we show how WMs allow us to make a stronger 
connection between PPSs and contextuality. This weakens the assertion that j2H] 
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"PPS paradoxes do not require contextuality for their explanation but do require 
measurement-disturbance." 



2.2.1 HVTs and PPSs: 

In quantum states are given as probability distributions \i over HVTs A, such 
that J n n(X)d\ = 1, where Q is the set of possible HVTs and measurements are char- 
acterized by idempotent indicator functions Xj 1 '■ ^ ~ > {0, 1}, such that J2j X^fW 
1 . The probability to obtain an outcome j for a random variable X when a measure- 
ment M is performed on a system in \x then is 

p,(X M = j) = jf x f (A)//(A)dA (2.13) 

In addition, a transition from uo to some other HVT state A could result from the 
measurement process [2H], and this is modeled by a transition probability D^(X,uj) 
such that J n Dj(\, uS)d\ = 1. This approach to HVTs may then be applied [2B| to 
ABL: 

r, -HA A A/H - I n X P ost(X)T f(X,uj)fi pie (uj)dujdX 

h X P ost(A)(rf (A, u) + r™ fe (A, w))/x pre (w)c?wc?A 

(2.14) 

WMs and WVs can also be described in this language (the usual projective measure- 
ment typically utilized in quantum experiments are a special case of WMs jlH]) an d 
this will be developed in a subsequent article jlH]- Some general considerations on 
possible relationships between HVTs and WVs are: 

• Epistemic nature of probability: If the probabilistic nature of I^(A, oS) 
and Xi'(A) are of epistemic origin, then this HVT approach [23] requires that 
non-commuting observables (such as p and x) can have a "simultaneous" pre- 
cise reality, as suggested, e.g. by the Wigner-Moyal method. If we require 
that any theoretical formalism should include exactly what can be measured 
(no more and no less), then it should be possible to make measurements on 
these projections. While such densities do give the correct average of a func- 
tion, i.e. / p(x,p)f(x,p)dxdp (thus appear to behave as proper densities), they 
also have un-physical aspects, i.e. mathematical artefacts, when the densities 
become negative. The reason (as will be shown subsequently j3Hl) is that if we 
attempt to actually measure such "negative" properties, then the result does 
not correspond to a physical observable in Hilbert Space. E.g. if we did try 
to project on p and x as densities simultaneously, then we obtain the parity 
operator, taking a generic ip(x) to ip(—x). To see this, we translate the classical 
projection p = and x = into QM: 

oo /*oo roc roc ia g 

e iax+i(3p dadp / / eze ia& e l/3 PdadP (2.15) 



i/3p 



oo J —oo J —oo J —oo 

QM 



Consider applying this to a generic wavefunction. First, the exponential, e 
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translates ip(x). Integrating then over a produces a delta function: 



oo poo 



■ a/3 ■ 



c ip(x + [3)dadf3 



>ip(x + p)dp (2.16) 



Finally, integrating over f3, we obtain (3 = —2x, and thus i/j(x — 2x) = ip(—x). 
Therefore, the quantum analog of the classical projection does not correspond 
to a quantum projector: it corresponds to a highly non-local result, the parity 
operator. 

Ontological nature of probability: In contrast to this non-physical aspect, 
non-classical WVs (e.g. negative WVs for projection operators) can be seen 
experimentally. There are additional advantages besides this greater harmony 
between measurements and theory j^U]- E.g. if the quantum/probabilistic na- 
ture of -Dj^A, uj) is taken to be fundamental, then a natural/axiomatic explana- 
tion can be obtained [IB] for 2 seemingly opposite situations: a) the contextual 
dependence of value assignments on details of MD and b) the fact that the 
statistics do not depend on the details of MD. These can be harmonized as a 
consistency check with causality [IB] as indicated by inserting a complete set 
of states {|^fi n )i} into A: 



El*fl»>i<* 



&n\j 





3 


A 


*in> 




3 



(2.17) 



Thus, one can think of A for the whole ensemble as being built out of pre- and 
post-selected states in which the WV is multiplied by a probability for post- 
selection. The fluctuation in the system then is also relevant for the probability 
to obtain different post-selections: as the fluctuation in the system increases, 
the probability of a rare or eccentric post-selection also increases. However 
an attempt to see this through WMs will require the spread in the MD to be 
increased and this increases the probability of seeing the strange result as an 
error of the MD. 

Time-symmetry versus time-asymmetry: The HVT approach utilized 
in (22] focuses on the probability of post-selection and brings in an element of 
time-asymmetry, apparently endemic to a Bayesian approach [T4j . 



2.2.2 Logical-PPS-paradoxes imply contextuality through WMs: 

A logical-PPS-paradox occurs if Phvt = 1 for several incompatible situations. This 
can occur if 

^X pos t(X)T™(\,u;)ti pie (u;)dwd\ = (2.18) 
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However, Leifer and Spekkens argue that the elements in the PPS-paradox have 
nothing to do with contextuality because their interpretation is different: "..we did 
not show that a logical PPS paradox is itself a proof of contextuality; measurements 
that are temporal successors in the PPS paradox must be treated as counterfactual 
alternatives in the proof of contextuality. This distinction is critical, since an earlier 
measurement can cause a disturbance to the ontic state that is monitored by a later 
measurement". However, WMs (instead of IMs) can give an empirical meaning to 
"alternative possible measurements at a single time". Although these alternatives 
are usually regarded as counter-factual, they will all be true simultaneously with 
WMs. 

Theorem: Logical-PPS-paradoxes imply contextuality through WMs. Proof: The- 
orem 2 allows us to state that all counterfactual statements which maintain the 
occurence of an outcome with certainty will all be true simultaneously when they 
are measured weakly. Theorem 2 is applicable to the precise elements utilized in the 
contextuality proof [2E]. In addition, given that WVs are by definition independent 
of the type of WM and given eq. 12.171 and since WVs can violate the algebraic con- 
ditions (the product rule) required for BKS and noncontextual-HVTs we have 
now proven that logical-PPS-paradoxes which assign definite probabilities (of or 1) 
via ABL, are in fact proofs of contextuality if all the "alternative possible intermedi- 
ate measurements" are performed weakly and the argument that their interpretation 
is different does not apply. 

We have thereby mitigated the attempt to explain these PPS "paradoxes" as a 
result of disturbance (since there is no disturbance with WMs) and have strength- 
ened j2H] the connection between these "paradoxes" (which we will argue cannot be 
reproduced by a "classical-like" HVT) and contextuality. 

3 PPS AND CONTEXTUALITY IN 4-D 

In the 3-box-paradox, the product of observables was always definite, i.e. PaPb = 
and the proof of contextuality was state dependent. In this section, we consider a 
slightly different situation (4D BKS nonets) in which the product of observables can 
give 2 different values. Except for this difference, this 4D BKS nonet example is sim- 
ilar to the 3-box-paradox in that we shall also analyze them in terms of PPSs thereby 
revealing surprising predictions for IMs and WMs and will demonstrate the identical 
issues of diagonal-PPS measurements, violation of the product rule, contextuality, 
and WMs which cannot be explained by a noncontextual-HVT. 

3.1 Review of Mermin's 4-D BKS theorem 

We consider Mermin's version of BKS with a set of 9 observables. It is intuitive j2H] 
to represent all the "functional relationships between mutually commuting subsets 
of the observables," i.e. V^(F{A}) = F{V^(A)}, by drawing them in fig. 2 and 
arranging them so that all the observables in each row (and column) commute with 
all the other observables in the same row (or column). 
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Figure 2: 4-D BKS example 

Individually, each of the 9 observables depicted in fig. 2 has eigenvalues ±1. A 
noncontextual-HVT requires an assignment of values that is ±1. In addition, eq. 11.11 
requires that the value assigned to the product of all three observables in any row 
or column must obey the same identities that the observables themselves satisfy, i.e. 
the product of the values assigned to the observables in each oval yields a result of 
+1 except in the last column which gives — 1. (The value assignments are given by 
V$i) = (al® J 2 ), V^al) = {I l ®al)... V^ x ) = (&l®o*) ). Computing column 
3 of fig. 2: 



{<^}{<% 2 H<% 2 } 



commute so c 



commute so c 



- 2 " 1 * 2 



-2-2 -1-2 
(T x (T y (T x (T y a z a Z 



a. 



(3.19) 



Computing the product of the observables in the third row, i.e.: 

r ^-1^.21 fi.2i.ll fi.li.2l i.1 i.2i.2 i.1 f i.li.21 '1-1 r -i.2l r i.1 i.2l 

Wx^y\Wx^y\Wz a z\ = °x ^x ^yWz^zl = ^x^y {-^z\Wz z) 



(3.20) 



If the product rule is applied to the value assignments made in the rows, then: 
V^l)Vj{al)Vj{alal) = V^V^V^) = 



while the column identities require: 



(3.21; 



VM)VAvl)VMal) = VM)Vr{al)VMa^ 



y y/ V 



(3.22) 



However, it is easy to show that the 9 numbers V? cannot satisfy all 6 constraints: 
multiplying all 9 observables together gives 2 different results, a +1 when it is done 
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row by row and a —1 when it is done column by column: 

v4*l)Vj(*l)Vj(alal)..Vj(*la 2 z ) = +1 (3.23) 

V^di)Vj(di)Vj(&ffi)..Vj(&ld*) = -1 (3.24) 

There obviously is no consistent solution to eqs. 13 . 241 and 13 . 2 31 since they contain the 
same set of numbers, simply ordered differently. Therefore the values assigned to the 
observables cannot obey the same identities that the observables themselves obey, 
VAF{A}) 7^ F{VV(A)}, and an HVT would have to assign values to observables 
in a way that depended on the choice of which of 2 mutually commuting sets of 
observables that were also chosen to measure, i.e. the values assigned are contextual. 

3.2 ABL, VAA and BKS nonets 

Following Vaidman, Albert, and Aharonov (VAA) j3], Mermin showed how to assign 
a definite value to a single measurement of any one of the nine observables of a 
BKS nonet [22|. He then generalized this to a definite assignment to any one of 16 
observables from the sets of nonets and showed that this assignment cannot be done 
if one attempts to measure (or ascertain) 2 or more of the observables belonging to 
the nonets. He left open the question as to the physical reason for this, stating "I find 
this intriguing" j2Z| To address this, we present a physical reason to demonstrate why 
the VAA scheme cannot be applied to 2 or more measurements by showing that the 2 
measurements interfere with each other given the necessary PPSs. This can be seen to 
be a consequence of TSQM: some assignment of eigenvalues to operators are based on 
just one of the two vectors (i.e. either the pre- or the post-selected vector) while some 
assignment of eigenvalues are based on both vectors (i.e. both the pre-selected and the 
post-selected vectors - what we call diagonal-PPS). In this picture, it is the utilization 
of more than one PPS and the subsequent interference between them that explains 
the violation of the product rule and thus the physical source of the "contextuality" 
(provided that the contradictory statements are obtained by measuring the product of 
observables, rather than by measuring the observables individually, and this is always 
the case). When assignments are not made in the diagonal-PPS sense, then sets of 
commuting observables which are determined entirely by just one vector satisfy 
the BKS function condition V^(F{A}) = F{Vj(A)} Sets of commuting observables 
which are assigned values in the diagonal-PPS sense by using information from both 
vectors do not satisfy the BKS function condition because they violate the product 
rule, and can disturb each other. 



3.2.1 Ascertaining the results of any one of the 9 observables: 

We begin our analysis by considering specific examples of PPS configurations. We 
then utilize ABL [2| and show that choosing different post-selections change which 
triplet of observables violate the product rule. Consider first a pre-selection of a\ = 1 
and al = 1 and a post-selection of fry = 1 and = 1 (see fig. Ela). In this case, 
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Figure 3: PPS states for Mermin example. 



it is easy to see that we can ascertain with certainty any one of the following values 
fr\ — 1, fr 2 — 1, &y — 1 or fr 2 — 1. We also know that we will obtain definite values 
of +1 if we measure any one of the following products of observables: fr x fr 2 = 1 and 
fr y fr 2 = 1, so we must also obtain fr^fr 2 fryfr 2 = +1. 

{ala 2 x a l y a 2 y ) = (*} = l\(fr 2 y = 1^^)^ = l)|a* = 1) = +1 (3.25) 

In addition, we obtain the same results if we switch the sequence of fr\fr\ and fr^fr 2 
because they commute. Any one of the other observables in fig. 2 (i.e. fr x fr 2 and fr 2 fry) 
can also be ascertained with certainty given this PPS. Finally, from the product of 
the 3 observables in column 3 {{o"la 2 x }{aya 2 }{ala 2 z } = -1 and {d- x al}{a y cr 2 } = +1), 
we can deduce that fr\fr 2 z = —1. Similar statements can be made for other PPSs. 
E.g. consider the pre-selection: a\ = 1 and a 2 = 1 and a post-selection of fry = 1 
and fr 2 = I (see fig. [3Jb) In this case, the measurement fr],fryfr 2 fr y = +1 and we can 
deduce from the third row that {frier 2 } = +1. 

Thus, given just one PPS, any single observable can be assigned a definite value, 
even though fr\fr 2 is assigned different values in different PPS. It is precisely because 
of this connection between particular PPSs and different values for fr\fr\ that the 
issue of contextuality arises when we consider products of these observables. 



3.2.2 Ascertaining the results of products of the 9 observables: 

In this section, we ask how many of the products of the 9 observables in fig. 2 
can be ascertained together with certainty. For example, as stated in the previous 
section, the outcome for the product of the first two observables in column 3 of fig. 
2 with the PPS of fig. Ela is a x a 2 a y a 2 = +1. However, if we measure the operators 
corresponding to the first 2 observables of row 3 in fig. 2, i.e. fr x fr 2 fr 2 fr y) given this 
particular PPS shown in fig. Ela, then the sequence of measurements interfere with 
each other (as represented by the slanted ovals in figure 0a). To see this, consider 
that fr x fr 2 fr 2 fr y corresponds to the sequence of measurements represented in figure 
EJa. While the pre-selection of particle 2 is fr 2 = 1 at t; n , the next measurement 
after the pre-selection at t 2 is for fr 2 and only after that a measurement of fr 2 is 
performed at t 3 . Thus, there is no guarantee that the fr 2 measurement at t 3 will give 
the same value as the pre-selected state of fr 2 = 1 or that the fr 2 measurement will 
give the same value as the post-selected state of fr 2 = 1. In TSQM, this is due to 
the disturbance of the 2- vector boundary conditions which is created by the IM: the 
initial pre-selected vector fr 2 = 1 from ti n is "destroyed" when the fr 2 measurement at 
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Figure 4: Time sequence of PPS measurements for Mermin example. 

time t 2 is performed and therefore cannot inform the later a 2 measurement at time 
t^. In other words, with the particular PPS given in fig. Ela andEJa, the operator, 
frxd'yd'x&y depends on information from both the pre-selected vector <j\ — 1, a 2 — 1 
and the post-selected vector <r* = 1, a 2 = 1 in a diagonal-PPS sense. We call this 
diagonal-PPS because a line connecting &l(ti) with cr 2 (t 3 ) will be diagonal or will 
cross the line connecting &y(t 2 ) with ^(^4), where t- m <t±< t 2 ... < £fi n , see fig. Oa). 

These results can also be seen in an actual measurement situation, we consider 
interaction Hamiltonians with coupling terms a].5(t — b 2 5{t — t 2 ), cr 2 5{t — t 2 ) and 



11(^ = 11^' 



m 2 



mi 



1) 
1) 



1 



(3.26) 



Since e l ® ir7xCrx 
outcome of -+ 



commutes with e v 
1 is obtained 

1 _ 1MA-2 



• a y, they can be interchanged and thus the same 
However, for the other observables (<7* = l\(&y = 
l\ e iwiviVy e iW2crya-^L _ — 1} the opposite eigenvalue is obtained (even though 

they commute) i.e. b-].a 2 a 2 a y will give an outcome of —1 given the PPS of fig. EJa 
even though separately b\a 2 = +1 and & 2 a y = +1. This is thus a violation of the 
product rule (see fig. 0a). This diagonal-PPS phenomenon can be generalized to 
functions that are polynomials of products of observables with the proper ordering 
(i.e. no mixing or sandwiching). 





Figure 5: a) Measurement of ala 2 a 2 a y is diagonal, b) measurement of a^a 2 a y a 2 is diag- 
onal 
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To summarize this sub-section, given the PPS of fig. |SJa, the subset of observables 
circled in fig. 6. a (and the products of those circled observables) can be assigned 
eigenvalues in a way that satisfies the function relation requirement eq. 11.11 But, the 
product of the other observables (e.g. a\a 2 and a^aV) can only be ascertained (given 
this particular PPS) using information from both the pre- and post-selected vector 
in a diagonal sense (see fig. 03a), and will thus violate the product rule. With the 
PPS of fig. G3b, the subset in fig. 6.b (and the relevant products of observables) can 
be assigned eigenvalues in a way that satisfies the function relation requirement eq. 
11.11 But, the product of the other observables a\a 2 x and a^a 2 , i.e. a^a^a^a^ violate 
the functi on rule eq. Il.ll 



at 



a: 



~ 1 ~ 2 ~ 2 ~ 1 ^ 1 ~ 2 

<Wt °x°y Tz°z 



1 a 2 




a: 




V x y al 



(b) 

Figure 6: Products of observables that are not disturbed, a) given the PPS of fi£ 
2. a, and b) given the PPS of fig. 2.b 



3.2.3 Ascertaining the results of any one of 16 observables through the 
generalized state: 

As was explained in &3.2.11 definite results for any one of the intermediate measure- 
ments can be obtained for several different PPSs which were complete measurements 
(and thus describable by a wavefunction). In addition, some triplets (products of ob- 
servables) can also be ascertained (see fig. 6) given a particular pre- or post-selection. 
However, for the most general setup considered in this section, no 2 products of ob- 
servables can be ascertained. The general setup considers superpositions of these 
PPSs. In fact, this is required in order to ascertain any one of the 16 observables in 
Mermin's successful generalization of VAA (it is also required to ascertain any one 
of the 9 for general PPSs). Following VAA Mermin showed that the way that 
any one of the 16 values of the 7 BKS nonets (e.g. fig. 2 is one of them, the others 
given by a v a^) 

can be ascertained with certainty is by entangling the 2-particle system repre- 
senting the 4D BKS nonet (represented by the pre and post-selection of fig. EJa and 
fig. EJb, etc., and labeled as with another system, i.e. an ancilla (represented 
here by j^j) and The nonet is prepared at t m by correlating it with a set of 

states of an ancilla: 

|A) = S^I*i>IO (3-27) 
Then the ancilla is "guarded" so there are no interactions with the ancilla during the 
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time (t; n ,tfi n ). At tfi n we post select on the particle and ancilla and obtain the state: 

1 



EI^)N) 



(3.28) 



If we are successful in obtaining this state for the post-selection, then the state of 
the system is described in the intermediate time by the entangled state (see figure 

0) UM 



(*1 



(*2 



* = ^Oi<*< II ®i) 
(*3| 1 



(3.29) 



correlated correlated 



+ 



+ 



\*n) 



|$l) |$ 2 ) 1^3) 

Figure 7: Generalized State: superpositions of 2-vectors given by eq. 13.291 

For general PPSs, we use multiple sets of boundary conditions given by fig. H3a 
and fig. EJb, etc., to get an entangled state represented by fig. |H1 (where for simplicity 
we have taken the states of the ancilla to be an orthonormal set, f M ). Mermin 
then presented an elegant method to determine the states of the ancilla necessary 
to produce the effect: he selected a definite representation for the 2-particle spins, 
performed a projection, P, onto the subspace given by the nonet, and solved under 
rotation for the state of the ancilla. That is, (n|P|A) = for "all but a single one 
of the projections associated with..." [2Zj the observable of the nonet that is to be 
ascertained with certainty. 
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Figure 8: Generalized state for BKS nonets 



As Mermin proved, all 4 components of this generalized state are necessary (i.e. 
we only determine a Bell-state at tfi n on the ancilla rather than make a projection onto 
any given component f M ) to ascertain a definite answer to any one of the individual 
observables. 



3.2.4 A physical reason for restrictions on these assignments 

We have suggested a physical reason based on TSQM and PPS for the 2 different 
values for This points to a physical reason why no 2 measurements can be 
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ascertained with certainty in the intermediate 3 time: all sets of boundary conditions 
are needed (those corresponding to both Ola andElb, etc.) in order to ascertain with 
certainty the value of any one of the 16 observables, as represented by fig. El However, 
when the first observable is ascertained, then it will depend on both the pre- and the 
post-selection measurement (i.e. it will be diagonal-PPS) in 2 of the 4 components 
of the generalized state (see A3.2.2JI and will collapse the entire configuration onto 
a subset of the PPSs, thereby disturbing the terms of the generalized state. Given 
any pair of measurements, there will always be a diagonal-PPS situation when all 4 
components of the generalized state are considered. This can be seen by comparing 
figs. 6. a and 6.b and noting that any 2 observables will not be circled in both. 
Therefore, since we cannot be sure that the entire setup (see fig. IHJ) is not disturbed, 
we cannot ascertain with certainty the outcome for any one of the 16 observables for 
the second measurement. Furthermore, this arrangement is the maximal correlation 
that can be performed (i.e. the 4D state of 2 spins can be maximally correlated 
to another 4D system as performed here), and thus we cannot create an even more 
sophisticated situation with additional ancillas. We have thus given a physical picture 
for Mermin's "intriguing" question: there will always be a diagonal situation for any 
2 observables. 



3.3 Non-classical Weak Values for the 4D BKS nonets 

We can now clarify Mermin's statement: "Alice's other two 'results' have nothing 
to do with any properties of the particle or the results of any measurement actually 
performed." j2Z| While it is certainly true that these "other results" cannot be as- 
certained simultaneously in terms of an IM (as was demonstrated in A3. 2.41 and by 
Mermin), they can be measured simultaneously through WMs. 

The route to an easy calculation of WVs can be established from Mermin's de- 
scription of VAA's accomplishment: "Alice's list gives the observed result for the 
measurement Bob actually made and had he measured anything else it would have 
given the result he observed." [27| This provides a direct route to WMs through the- 
orem 2: WMs will produce the identical result as predicted for the IM since the IM 
results are definite. Thus, the other 'results' are related to properties of the particle 
and can be simultaneously measured. 

We can also obtain non-classical results in this example (similar to the 3-box- 
paradox), by first re- writing the observables of fig. 2 in terms of 3 spin components 
for 2 "virtual" particles: for the first particle, S3 = b\.o^ §2 = and S\ = 

for particle 2, components which commute with particle 1 are S| = o^o^, S| = crl&l, 
and Sf = &l. We can observe a non-classical WV by noting that o\a^alp^ = — 1 
given the PPS of fig. EJa even though separately o\a^ = +1 and o^cr^ = +1, i.e. 
a violation of the product rule and thus a diagonal situation (see fig. Ela). Thus 
WMs must yield the same outcomes, i.e. N(l§3) w = +1 and N(S|) W = +1 but 

3 If we were considering a single PPS, as discussed in ^3.2.11 and i J3.2.2l then some pairs of 
products of observables can be ascertained with certainty, but not any 2 pairs. No 2 products can 
be ascertained in the case of 16 observables and some 2 pairs of products cannot be ascertained for 
the 9 observables. 
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N(SgSg) w = — 1, a non-classical result. To analyze these results, we define the 
following pair occupation operators: 

N++ the projector on the state §3 = 1, and S§ = 1 
N+_ the projector on the state S3 = 1, and S§ = — 1 

N (_ the projector on the state S3 = —1, and S§ = 1 

N the projector on the state S3 = —1, and S§ = —1. 

We can relate these measurements to the 3 box example of §21 but in this case we 
have 2 boxes and 2 particles: N++ means the number of times that particle 1 and 

particle 2 are in the first box, N H means the number of times that particle 1 is in 

the first box and particle 2 is in the second box, etc. 

The WV of the projection operator (1 — S3XI — S|) is — |, a non-classical result. 
From Theorems 1 and 2, we can deduce the following: the different ways to obtain 
N(Sjj) = +1 are given by N ++ (i.e. S\ = 1, and Sj = 1) and N_ + (i.e. S\ = -1, 
and S3 = 1) and therefore we can deduce that: 

N*+ = N++ + N_+ = 1 (3.30) 

In terms of the box analogy, this is how many ways that particle 2 can be found in 
box 1. Also N(Sg) = +1, and thus N(S§) ^ —1 (i.e. how many ways can particle 1 
be found in box 1). This is characterized by: 

N+* = N++ + N+_ = 1 (3.31) 

From SgS| = —1 (which again means that both particles cannot be found in the 
same box), it cannot be that S^S^ = +1 and thus S3 and S3 must be opposite in 
sign. This can be characterized by: 

N ++ + N__=0 (3.32) 

Furthermore, since eq. 13.301 equals eq. 13.311 we can deduce: 

N+_ = N_ + (3.33) 

Subtracting eq. 13.321 from the following identity 

N++ + N__ + N+_ + N_ + = +1. (3.34) 

we obtain: 

N+_ + N_+ = l (3.35) 

Eq. 13.351 implies that the 2 particles are never in the same box. From eq. 13.331 and 
eq. 13.351 we can deduce: 

N_ + = N + _ = 1 (3.36) 
Substituting this value into eq. 13.311 we can deduce that: 

N++ = \ (3-37) 
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Finally, substituting this into eq. 13.321 we can deduce: 

N- = -\. (3.38) 

As shown in [TT] > all these statements can be measured simultaneously through WMs 
and will yield: 

(N_ + ) w = (N + _) w = (N ++ ) w = i (3.39) 

while: 

0*-)w = ~\ (3-40) 

In other words, if a WM is performed on the number of times that particle 1 is in 
the first box and particle 2 is in the first box, then the result is the non-classical 
result — |. Thus, the way that the 2 seemingly contradictory statements o\a\ = ±1 
can weakly "peacefully co-exist" (to paraphrase Abner Shimony) is that a WV goes 
outside the spectrum of possible eigenvalues, i.e. eq. I3.4UI 

In summary, we see again that WMs give an empirical manifestation of BKS: 

• the BKS "contradiction" here is that b\b 2 ,o 1 J .o\ j = — 1 (given the PPS of fig. 
HJa) even though separately b\ai = +1 and a^&y = +1 

• these 3 outcomes can be measured weakly without contradiction because the 
product of WVs is not equal to the WV of the product 

• if BKS were not correct and a noncontextual-HVT were possible, then the 
product rule should be satisfied and an IM of o\o y l o 2 x <j\ j should yield +1. This 
leads to an immediate contradiction because: 

— by theorem 2, the WV must be equal to the ideal result 

— but, this would be inconsistent with an actual WM which will register 

(N ) w 2 1 

• therefore, BKS is empirically consistent with WMs 

We have thus given a physical explanation for why an IM cannot reveal these values, 
while WMs can reveal these values. Thus, with WMs, the BKS "contradiction" still 
exists (i.e. a noncontextual-HVT cannot reproduce QM), yet now it can also be 
measured. In other words, we have physically shown how to obtain 

• +1 for the product of all nine observables when this is performed in the sequence 
of the rows of fig. 2 

• —1 for the product of all nine observables when this is performed in the sequence 
of the columns of fig. 2 

(assuming that the system is PPS and measured weakly). The ambiguity in deter- 
mining whether o\o\ = +1 is obtained or b\a 2 z = —1 is obtained gets shifted to the 
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ambiguity of determining which set of boundary conditions is obtained, i.e. it is now 
a physical property of the system. 

A modification to the experimental setup suggested by jHH] could be used to test 
the predictions made in this paper. This setup considers a 4-D Hilbert space repre- 
sented by two 2-D subsystems, e.g. the path and polarization of a single photon. j2H] 
find a BKS contradiction for a particular entangled state. If the transverse positions 
of each photon is used as the pointer, then WMs can be obtained in with small 
transverse displacements. The postselected photon distribution then determines the 
size of these displacements. Using the technique of jTU], WMs can be performed if 
an optical glass is slightly tilted so that the photon's position is uncertain to within 
the width of the beam. As shown in the next section, the WVs calculated in < j3.3l 
(e.g. eqs. 13.401 and 13. 3 9 J) are identical to WVs for EPR entanglement jHD] and thus 
entanglement in a pre-selected state of 2 particles is isomorphic to an entanglement 
in our 2 virtual particles. 



3.4 Non-classical weak values for EPR and Peres/BKS 

We shall now show that WVs can show new connections between BKS and EPR. Eq. 
13.401 and 13.391 give the same result as calculating WVs for EPR entanglement |3U| 
and thus there is an interesting new kind of isomorphism between the problems of 
WVs in BKS nonets and EPR. We can also consider interesting manifestations of the 
"contextuality" in these situations by making separate measurements on the ancilla. 
Consider a pre-selected state which is entangled between one of the particles of the 
nonet and the ancilla (where, following Mermin, we have chosen the ancilla to be an 
orthonormal set If 1 ): 

\A) = ±={\a\ = +l)|f° = +1) - \a\ = -l)|f° = -1) (3.41) 

If we consider product state post-selections, e.g. with a\ = 1 and f° = +1, then we 
know that an ideal measurement of (jj.f must yield 



The pre-selected state also yields 4 



&y = 1 (3.42) 



r^f + f'al = 0. (3.43) 



Thus 



{^f° + fv 2 x }{i rj>|f° = -i) - 1 ii>|f° = +i)} = 
{-i U)if° = -i) + 1 ri>i» - {i n>i> - 1 U)if° = -m 



(3.44) 



4 This is easy to see because i x a\ | ti) |f° = -1) -f | Tj)|f° = +1) and^f°| Ti)|f° = -1) -► -| il 
|f° = -1) and f'al\ lj)|f° = +1) - -| H)\f° = -1) and ^f°| U>|f° = +1) - | tj>|f° = +!>• 
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We can deduce outcomes for un-performed measurements of a\: 

&l = -1. (3.45) 

(we have assumed that the ancilla is not disturbed to obtain this conclusion). We 
can also deduce outcomes for un-performed measurements of f 1 (assuming that the 
first particle is not disturbed): 

f 1 = -1. (3.46) 

Once again, we see a violation of the product rule [2H]: from eq. 13 .441 we deduce that 
f 1 ^ = —1, but if constructed individually from eqs. 13.461 and 13.451 l x o\ = +1. This 
"conclusion," however, relies on counterfactual statements, since not all the required 
measurements eqs. I3.43H3.461 can be performed simultaneously without disturbing 
each other. However, we can perform WMs on all these statements simultaneously 
(see [ECfl . 

WVs in the EPR situation can also be seen in the instant example if we consider 
this PPS (instead of a Bell-state, we measure a definite state of the ancilla). We see 
the identical non-classical results as seen in the previous section. First, we define the 
following projectors 

N ++ the projector on the state a\ — 1, and f 1 = 1 
iV + _ the projector on the state a\ = 1, and f 1 = — 1 
N_ + the projector on the state a\ = —1, and f 1 = 1 
N the projector on the state a\ = —1, and f 1 = —1. 

From Theorem 1 and 2, we can deduce the following: from the post-selection a\ = +1, 
we can deduce that f 1 = —1, i.e. eq. 13.461 The different ways to obtain f 1 = — 1 are 

given by 7V + _ (i.e. a\ = 1, and f 1 = —1) and N (i.e. d\ = —1, and f 1 = —1) and 

therefore we can deduce the conservation relationship: 

iV+_ + M_ = 1 (3.47) 

Now b\ = —1, and thus a\ ^ +1 is characterized by: 

N+* = N ++ + iV+_ = (3.48) 

In terms of the box analogy, this is how many ways that particle 1 can be found in 
box 1. In addition, if f 1 = —1 then f 1 ^ +1 thereby giving: 

iV ++ + i\L+ = (3.49) 

That is, in how many ways can particle 2 be found in box 2. Furthermore, since eq. 
13.491 equals eq. 13.481 we can deduce: 

iV + _ = AL+ (3.50) 

From the post-selection, we know that an ideal measurement of a^f will yield +1, 
therefore, using eq. 13.431 we can deduce that f 1 *^ = — 1 (which means that both 
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particles cannot be found in the same box), i.e. 

£^+f^=0. (3.51) 

=+i =>=-i 

Next we ask how to obtain f <r* = — 1. To obtain this, it cannot be that f 1 ^ = +1 
and thus f 1 and a\ must be opposite in sign. This can be characterized by: 

N ++ + JVL_ = (3.52) 

Subtracting eq. 13.521 from the following identity 

iV ++ + M_ + iV+„ + iV_+ = +1. (3.53) 

we obtain: 

iV+_ + M + = 1 (3.54) 
From eq. I3.5UI and eq. 13.541 we can deduce: 

N-+ = iV + _ = i (3.55) 
Plugging this value into eq. 13.481 we can deduce that: 

N ++ = ~\ ( 3 - 56 ) 
Finally, plugging this into eq. 13.521 we can deduce: 

N- = \ (3.57) 

Entanglement in a pre-selected state of 2 particles is isomorphic to an entanglement 
in our 2 virtual particles. Thus, if we look at the right variables, then the BKS setup 
can be seen to be related to EPR. 

The situation analyzed above is general and also points to the Peres/BKS-example |49j : 
in summary, consider a pre-selected state \^epr) = 775 (| Tz)l il) ~ I \ tz)) f° r which 
the following identity holds o\a 2 + &l&y = 0. In addition, \^epr) is also an eigen- 
vector (with eigenvalue —1) of the following operators <3"^a^, cr^a^, <j\o 2 z . It's easy to 
see that a noncontextual-HVT cannot assign values consistent with these operator 
relations: 

V^alal) = V^al) = = -1 (3.58) 

Consider the commuting observables A\ = o\a^ and A 2 = ^1p\- We know that 

A X A 2 = a l x a 2 y a 2 x al = a 2 z a\ = -1 (3.59) 

in the singlet state. The assumption of non-contextuality is that value assignments 
can be made to eq. 13.591 even when these assignments are taken from a different 



23 



context: e.g. assigning values from eq. 13.581 we obtain: 

V^l)V^al)V^al)V^al) = (-1)(-1) = +1 (3.60) 

but experimentally, we obtain —1 from eq. 13.591 a contradiction. Thus a noncontextual- 
HVT is impossible. However, now we can probe this state by post-selections and 
obtain a\.ai = — 1 even though a\ = — 1 and &y = —1. This, again, can only be 
done with PPSs since in a pre-selected only system, for 2 commuting observables, 
the product rule is satisfied in contrast to PPSs. In addition, for this type of contex- 
tuality with PPSs, either the pre-selected or the post-selected state must be the EPR 
state (with an additional assumption of invariance under exchange of particles). 

4 PPS AND CONTEXTUALITY IN HIGHER 
DIMENSIONS 

A future article will show how the feat presented in the previous section can be 
done in higher dimensions, e.g. to the GHZ state [121 OH]- 111 this case we see that it 
is not possible to replace the spin operators by ordinary numbers (which is what a 
noncontextual-HVT attempts to do). However, assignments can be correctly made 
to each of the cr's if post-selection is utilized. Once again, the limitations to these 
assignments can be seen by using the structure of TSQM: if we try to measure all of 
these observables together, then some of the values will be assigned in the diagonal- 
PPS sense, and therefore measuring these observables will cause a disturbance even 
though they commute. 

Consider the GHZ case of 3 spins pre-selected in the state: 

= tfM) ~ llllll) (4.61) 

Consider that the pre-selected state is an eigenstate of the following operators: 
A\ = &ld-yd-y, A 2 = ayalcfy, and A 3 = a^a^al with eigenvalue +1. Also, the prese- 
lected state is an eigenstate of: 

A 4 ee a l x a 2 x al (4.62) 

with eigenvalue, —1, and finally A 1 A 2 A 3 = — A 4 However, because Ai, A 2 , A 3 , 
and Ai commute, and because a l x , a y , and a* commute one may ask whether it 
is possible to also satisfy the above equations by replacing the spin operators by 
ordinary numbers a\ = ±1, <7* = ±1, which is what a non-contextual HVT attempts 
to do. Without post-selection, this cannot be done because the assignments are, 
once again, inconsistent with the multiplicative structure of the observables because 
AxA 2 A 3 = 1, a contradiction. 

4.1 GHZ and PPS 

Assignments can be correctly made to each of the cr's if post-selection is utilized with 
limitations again arising from diagonal assignments. E.g., consider a post-selection 
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of the three particles by measuring their x component with a x = — 1: 

(Vfinl = (ililil I (4.63) 

From the post-selected state we strongly know that d'la^.a^. = — 1 and from the 
pre-selected state we strongly know that a l x a y ay = +1. Thus statements of the form 
o~ y &y = — 1 can only be made when information is used from both the pre-selected 
vector and from the post-selected vector. For example, if a measurement of o^a^ is 
performed, then we will definitely find cTyO^ — — 1. However, if we attempt to perform 
a second measurement, e.g. of o-y&y then we will not find fryfry = — 1, because the or y &y 
measurement will destroy the pre-selected vector which contains the a l x a y ay = +1 
information that the &^& y measurement depends on. This disturbance occurs even 
though CFydy and fryfry commute! 



4.2 Weak Values in GHZ state 

We may also consider WMs of the GHZ observables. With the post-selection (\l/fin| = 
(lililx l> then in the intermediate time we can replace a\ = &l = a x = —1 and taking 
the inner product with this post-selection l^fin), we then find: 

^ - = -1 (4-65) 

Using again the analogy with particles in boxes, eq. ()4.64|) means that particle 2 and 
particle 3 are not together in the same box, while eq. ()4.65|) means that particle 1 
and particle 2 are not together in the same box, and eq. (|4.66J) means that particle 
1 and particle 3 are not together in the same box. But we only have 2 boxes, so if 
1 and 2 are not in the same box and 1 and 3 are not in the same box, then 2 and 
3 must be in the same box. It is clear that the above equalities cannot be satisfied 
simultaneously by replacing the operators for classical numbers taking the values ±1. 
To simplify this analysis, we define 

(N +++ ) w = I TwXTw li ® I TwXTi/ I2 ® I Is , (4-67) 

where the 2 boxes are denoted by ± referring to the spin component along y. Using 
Theorems 1 and 2, it can be shown that: 

(N +++ ) w = N = - 1 - (4.68) 

and 

(N ++ _) w = (N__ + ) M = (N + _ + ) w = (N^) w = ... = ~ (4.69) 

We have thus shown how to obtain a non-classical negative triplet WVs, which again, 
cannot be reproduced by a noncontextual-HVT. 
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5 CONCLUSION 

Mermin's results reviewed in §'s 13.2. 11)3. 2.41 were characterized by Mermin as "...what 
follows is not idle theorizing about 'hidden variables'. It is a rock solid quantum me- 
chanical effort to answer a perfectly legitimate quantum mechanical question." (2Zj 
BKS showed that noncontextual-HVT's are not possible in general. Therefore, an 
interesting approach to HVT's and BKS is whether anything new can be learned 
about experimental situations, in the same spirit as Shimony's apt phrase "exper- 
imental metaphysics." For example, Bell's Theorem led to interesting experiments 
which tested the notion of whether quantum entanglement could be stronger than 
classical correlations. Another example is Hardy's-paradox [33] (HVTs for position 
could not be assigned) which was traditionally "resolved" by arguing that measure- 
ments to verify the paradox could not be implemented simultaneously and therefore 
Hardy's-paradox was purely a formal result without empirical consequences. However 
in [TT| l3Uj. we demonstrated that WMs could be implemented simultaneously on the 
paradoxical statements (with experimental results [SB])- E.g. the electron is always 
on the overlapping path (Nq w = 1) and the positron is always on the overlapping path 
(Nq w = 1), yet they are never there together (N 'q w = 0). Again, non-classical WVs 
were demonstrated [TT] by the negative pair- wise occupation, Npp NOw — — 1. This 
again was a violation of the product rule ll.3l (Nq w Nq w ^ Nq'q w ) and a manifestation 
of diagonal-PPSs and non-classical WVs. Nevertheless, despite their non-classical be- 
havior, WVs do obey a simple, intuitive, and, most important, self- consistent logic. 

In this paper, we have carried this and the program initiated by Mermin further 
and have shown new ways that the "charming elementary mathematics" of BKS can 
manifest empirically. E.g. in the 3-box-paradox/diagonal-PPS of ^21 Pa = 1 if only 
box A is opened, while Pb = 1 if only box B is opened, but if we measure both 
box A and box B, then the particle will not be found in both boxes, i.e. PaPb = 
even though Pa and Pb commute, a violation of the product rule. If WMs are 
performed, then the non-classical result (Pc) w = —1 is obtained. In the Mermin 
case ^3.21 the violation of the product rule and diagonal-PPS is that o^a^o^cH, = — 1 
(given the PPS of fig. EJa) even though separately b\a^ = +1 and o^b^ = +1 and 

non-classical WVs are again obtained, (N ) w = — |. Similar results were obtained 

in the 4-D/EPR case (i.e. crJ.cXy = —1 even though a\ = —1 and a% = —1) and also 
in the 8-D GHZ case. 

How general are these considerations? Consider any set of commuting observ- 
ables A 1 , A 2 , A 3 and any other set of commuting observables Bi, B 2 , B 3 . Suppose 
we perform pre-selection measurements on A\, A 2 , A 3 and perform post-selection 
measurements on B\, B 2 , B 3 , and suppose, e.g. [A 2 ,S 2 ] ^ 0. Products of observ- 
ables, e.g. AiA 2 , can be ascertained solely based on the pre-selection, while products 
of observables, e.g. B 2 B 3 , can be ascertained solely based on the post-selection. How- 
ever, as soon as we change the sequence of measurements of A 2 and B 2 , then it is a 
diagonal- assignment and they will interfere with each other in PPS. Given this, it is 
evident that there are versions of BKS for which we cannot obtain definite results for 
IMs utilizing PPSs. It is not the case that everything that cannot be known by one 
vector can be known by two vectors: there are many sets of operators for which it 
is impossible to assign all the possible eigenvalues via PPS. However, our arguments 
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Figure 9: Weak Value density for 2 particles 



concerning WMs are completely general. 

The results of this article could be used to further exploit the differences between 
classical and quantum information: while the study of the non-classical aspects of 
entanglement (Einstein- Podolsky-Rosen/Bohm) started as a foundational examina- 
tion of HVTs, it was subsequently probed experimentally and used as a resource 
for quantum computation and communication. Similarly, using TSQM ^2], En- 
glert demonstrated a new form of cryptography ^J] and experiments based on the 
optical version of this problem [TH] were successfully performed. With respect to con- 
textuality, BKS is often based solely on the structure of operators, while quantum 
information applications require special states. In this article, we have shown how to 
probe and empirically manifest contextuality through PPS states, thereby opening 
new possibilities of utilizing contextuality in these applications. 

We end with a conjecture: if we start with a non-classical situation as reflected 
in BKS or a violation of Bell's-inequality, then we can always find a post-selection 
which can empirically manifest non-classical WVs. This thereby holds the possibility 
of shifting foundational debates into a new issue because we can now probe non- 
classicality in actual WM experiments. Non-classical WVs follow intuitively from the 
logic of WVs: e.g. consider a generic multi-particle systems in a higher dimensional 
space p(xi,x 2 ) ~ ^*j in {x\, x 2 )\l/i„,(xi, x 2 ) and consider also projections onto space 
and time. In general, p(xi,x 2 ) cannot be measured locally because there is no local 
way of measuring particle 1 at x\ simultaneously with measuring particle 2 at x 2 : 
this would involve a nonlocal Hamiltonian. However, the projection on each line 
can be measured separately: if it is known that particle 2 is at x 2 then this means 
/ p{x\,x-i)dx\ = 1. Similarly, if it is known that particle 1 is at X\ then this means 
/ p(x\,xi)dx2 = 1. However, asking the question if both particles are there together 
is another point in phase space (see fig. EJ). The particle also could have been 
located simultaneously in another position with certainty, but then we need to place 
a negative number somewhere else in order to satisfy the global constraint, i.e. the 
integral of the WVs has to add up to 1 (because there is just a single particle along 
each line), but the individual numbers at each point can be arbitrary. IM outcomes 
reveal an integration along just one line, but the WV densities are not just those 
lines. E.g. in the Hardy case the seeming contradiction that both particles are there 
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individually but are not there together is resolved by a negative number of particles 
at another point in phase space. 

While there is an operational or experimental meaning to a density over a set of 
commuting observables, as shown in £ 12.2.11 there is no experimental meaning for a 
density over a set of non-commuting observables even though such densities may have 
formal utility as an aide to calculation j^j. As a result of this dis-connect between 
densities over non-commuting observables and IMs, there is a similar problem with 
WMs and WVs. An attempt to give an operational or experimental meaning to 
WMs of densities over non-commuting observables will illustrate the same problem 
as demonstrated in 12.2.11 even though in this article we demonstrated the ability to 
define WMs and WVs for a discrete set of non-commuting observables. 
Acknowledgments: JT thanks Yakir Aharonov for many conversations and the 
Templeton foundation for support. 
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